منابع مشابه
Machine Learning to Improve a Document Pipeline
We describe a collaborative project between our research group and a small west-coast business to apply machine learning techniques to a document processing task. This experience suggests two key points: (1) even as machine learning and artificial intelligence matures, there are many business applications that have not yet exploited these techniques; and (2) academically well-established machin...
متن کاملGlobal discretization of continuous attributes as preprocessing for machine learning
Real-life data usually are presented in databases by real numbers. On the other hand, most inductive learning methods require a small number of attribute values. Thus it is necessary to convert input data sets with continuous attributes into input data sets with discrete attributes. Methods of discretization restricted to single continuous attributes will be called local, while methods that sim...
متن کاملDiscretization of Numerical Attributes Preprocessing for Machine Learning
Page 2 of 46 Abstract The area of Knowledge discovery and Data mining is growing rapidly. A large number of methods is employed to mine knowledge. Several of the methods rely of discrete data. However, most datasets used in real application have attributes with continuously values. To make the data mining techniques useful for such datasets, discretization is performed as a preprocessing step o...
متن کاملPreprocessing Input Data for Machine Learning by FCA
The paper presents an utilization of formal concept analysis in input data preprocessing for machine learning. Two preprocessing methods are presented. The first one consists in extending the set of attributes describing objects in input data table by new attributes and the second one consists in replacing the attributes by new attributes. In both methods the new attributes are defined by certa...
متن کاملPreprocessing of tandem mass spectra using machine learning methods
Protein identification has been more helpful than before in the diagnosis and treatment of many diseases, such as cancer, heart disease and HIV. Tandem mass spectrometry is a powerful tool for protein identification. In a typical experiment, proteins are broken into small amino acid oligomers called peptides. By determining the amino acid sequence of several peptides of a protein, its whole ami...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Frontiers in Immunology
سال: 2021
ISSN: 1664-3224
DOI: 10.3389/fimmu.2021.677870